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It is widely agreed that the human face is processed differently from other objects. How- 
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of clarifying what types of information are included in the holistic representation of a face. 
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In the field of visual perception, it is generally agreed that faces 
are processed differently to most other objects in that they are 
processed "holistically." However there is a lack of consensus and 
clarity in the literature regarding what is meant by holistic process- 
ing and how it is different from the part-based processing most 
commonly attributed to the perception of non-face objects. Dis- 
cussions of "whole" and "part" processing are in fact common 
across different areas of perception (i.e., visual, auditory, tactile), 
however it is not often made clear what constitutes a part and 
whole and how to differentiate them (Latimer and Stevens, 1997). 
This review article aims to address the lack of consensus and clarity 
in what is meant by parts and wholes in visual processing of faces 
and other objects. To do so we will first discuss the basis of the term 
"holistic" in Gestalt psychology and, through doing so, discuss how 
face perception may be qualitatively different from the perception 
of other objects. Secondly, we will discuss different ways in which 
configural and holistic processing have been conceptualized in the 
face perception literature and draw attention to alternate views 
of what is included in the "holistic representation" of a face (e.g., 
Maurer et al., 2002; Rossion, 2008; McKone and Yovel, 2009; Yovel, 
2009). This will be followed by an overview of direct and indi- 
rect measures of holistic processing and a brief discussion about 
other aspects of face (e.g., attractiveness judgments; Abbas and 
Duchaine, 2008) and whole body perception (e.g., Robbins and 
Coltheart, 2012a,b) to which measures of holistic processing have 
been successfully applied as well as a brief overview of the develop- 
mental aspects of holistic processing (more detailed reviews can 
be found in McKone et al, 2012). We will finish by discussing 
how using moving faces in conjunction with common measures 
of holistic processing might help to clarify some of these issues 
and allow facial processing to be explored in a more naturalistic 
way. 



HOW DO WE PERCEIVE OBJECTS AND FACES? 

It is generally agreed that most objects are processed on the basis 
of their individual parts or components (e.g., Biederman, 1987). 
A part-based perceptual model is useful as it allows: objects which 
cannot be seen in their entirety, from a single viewpoint, to be 
recognized based on their visible components; objects capable of 
engaging in non-rigid motion to be recognized easily regardless 
of part configuration (e.g., when rotated) and; objects to be per- 
ceived in a similar way to how we usually describe them (e.g., "that 
dog has pointy ears!"; Hoffman and Richards, 1984). However 
faces are thought to be processed in a qualitatively different way to 
most other objects (including objects of expertise; see Robbins and 
McKone, 2007; McKone and Robbins, 201 1 for reviews). Evidence 
of this can be seen in behavioral studies where modified facial stim- 
uli produce specific effects while other objects, that undergo the 
same manipulation, do not (e.g., Tanaka and Farah, 1993; Robbins 
and McKone, 2007). The difference has also been demonstrated 
in neuroimaging studies that have located specific face process- 
ing areas of the brain (see Kanwisher and Yovel, 2006 for review) 
and neuropsychological studies that have shown a double disso- 
ciation between specific impairments in the recognition of faces 
(i.e., prosopagnosia) and non-face objects (i.e., object agnosia; see 
Duchaine et al, 2006 for a review). 

Face perception is different to the perception of other objects in 
that it is more "holistic." Holism is the central premise of Gestalt 
theory which argues Gestalts are sensory wholes that are qualita- 
tively different to the sum of their individual parts or components 
in that they "possess properties that cannot be derived from the 
properties of their constituent parts" (Wagemans et al., 2012b, p. 
2). These properties are referred to as emergent features; an exam- 
ple of an emergent feature is the area of a square. A square possesses 
an area because its basic components (four lines of equal length) 
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FIGURE 1 | An interpretation of a relational/configural model of face 
perception integrating the Diamond and Carey (1986) and Rhodes 
(1988) models. The model starts with first-order (isolated) features (that are 
processed in a part-based manner), moves on to the emergence of 
second-order (relational) features (that are subject to configural processing), 
and finishes with increasingly complex higher-order features (attributions) 
which can involve a combination of first and/or second-order features. 



form an enclosed area that none of the lines can possess on their 
own. In the face perception literature terms such as configural, 
relational, and holistic are used to describe the emergent features 
of a face that only become apparent when two or more of its basic 
features (e.g., the eyes, nose, or mouth) are processed at the same 
time. What follows is a more comprehensive description of the 
wide array of terms used in the face perception literature. 

MODELS OF CONFIGURAL/RELATIONAL AND HOLISTIC 
PROCESSING 

Face perception differs from the perception of most other objects 
in that it relies heavily on emergent features (the interrelations 
between the more salient features of a face) as well as the fea- 
tures themselves. These emergent face features are often referred 
to as relational (e.g., Diamond and Carey, 1986) or configural 
information (e.g., Bartlett et al., 2003). These terms are used 
interchangeably here as we consider them to have fundamen- 
tal underlying similarities and in the following sections we will 
attempt to show how these two terms have been used in similar 
ways. We will also describe how these terms relate to the concept 
of holistic processing. 

Faces possess two types of relational/configural information: 
the first-order relational properties or first-order configuration 
refers to the basic configuration of the features within the face (e.g., 
eyes above nose, nose above mouth) while the second-order rela- 
tional properties or second-order configuration refers to variations 
in the spacing between and positioning of the features (Diamond 
and Carey, 1986). The first-order configuration is thought to be 
important for detecting a face while the second-order configu- 
ration is important for discriminating between individual faces 
(e.g., Diamond and Carey, 1986; Tsao and Livingstone, 2008). It is 
the second-order configuration that is normally referred to when 
discussing relational/configural information. Sensitivity to face- 
like first-order configuration seems to be present from birth (e.g., 
Johnson et al, 1991). 

A relational/configural model of face perception (e.g., Diamond 
and Carey, 1986; Rhodes, 1988; see Figure 1) is hierarchical in 
nature as it suggests that different types of judgments that can 
be made from faces (e.g., identity, expression, and attractiveness) 
can vary in the amount of second- order relational information 
needed to make them. According to Diamond and Carey (1986), 
information that can be used to discriminate between faces (i.e., 
identity) can be placed on a continuum ranging from isolated 
to relational features. Facial information that is relatively iso- 
lated (e.g., hair color) can be focused on without attending to 
information from other parts of the face. On the other hand, 
facial information that is acquired through processing two or 
more parts of a face simultaneously is said to be relational (as 
noted above, e.g., metric distance between the eyes). In a similar 
model proposed by Rhodes (1988), the cues used to discrimi- 
nate individual faces are broken down into first-order, second- 
order, and higher-order features. First- order features, like isolated 
features, are those most salient features that can be processed 
independently of others (e.g., eyes, nose, mouth). Second-order 
features, like features at the relational end of Diamond and Carey's 
(1986) continuum, are configural in nature and refer to indi- 
viduating information acquired through processing two parts of 



a face simultaneously (e.g., spacing between the eyebrows and 
hairline, i.e., the forehead); while higher-order features (attribu- 
tions) require a combination of several first-order and/or second- 
order features (e.g., age; Rhodes, 1988). The relational value of 
these higher- order features could be argued to vary depending on 
the number of second-order features included in them. Config- 
ural processing could best be described as the integration of all, 
or just some, of this second-order configural information within 
the face (Leder and Bruce, 2000; Bartlett et al, 2003). Individual 
pieces of configural (spacing) information may remain relatively 
variant or invariant while an individual face engages in differ- 
ent types of movement. For example the distance between the 
inner (or outer corners) of both eyes would remain fixed in 
cases of non-rigid (elastic) motion in the face but would change 
considerably with changes in facial viewpoint. The hierarchical 
nature of the relational/configural model suggests that configural 
processing is inseparable from part-based processing as the emer- 
gent features within a face arise from the interrelations between 
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isolated features. It also suggests that configural processing is more 
complex than part-based processing. 

A third important term in the face perception literature is holis- 
tic processing. In its most pure sense, the term holistic implies the 
processing of an object as a series of templates, each of which 
cannot be broken down into parts or the interrelations between 
them. Face processing has been conceptualized this way (Tanaka 
and Farah, 1993; Farah, 1996). Supporters of this view argue that 
the perceptual processes used for object and face perception are 
dichotomous; faces are perceived as undifferentiated wholes while 
objects are processed on the basis of their individual parts (see 
Figure 2A). A major problem with adopting this purely holis- 
tic model of face perception is that in order to account for the 
many changes that can occur within a single face (e.g., viewpoint, 
expression, hairstyle) a multitude of templates would be needed 
of that individual's face, which would require a very high memory 
load and may have implications for other cognitive and perceptual 
processes (Hoffman and Richards, 1984). There is also evidence to 
suggest that part-based processing does contribute to face recogni- 
tion and that this can be assessed independently from some kind of 
configural/relational processing (as previously defined; e.g., McK- 
one, 2004; Goffaux and Rossion, 2006). Overall then, a model 
of holistic processing with no decomposition into parts does not 
seem well supported. 

More commonly in the face perception literature, the term 
holistic implies a specialized form of processing that involves the 
integration of all the information in a face, but does not preclude 
part-based processing (e.g., Rossion, 2008; McKone and Yovel, 



2009). This will be referred to here as a holistic/part-based model 
(see Figure 2B) in which part-based and holistic processing act 
in parallel and both make important contributions during face 
perception. Advocates of this model believe that the holistic com- 
ponent of face processing can be isolated from the part-based 
component. Support for this has been found in studies where test 
stimuli are displayed in a subjects' peripheral vision (e.g., McK- 
one, 2004) or filtered to only include low spatial frequencies (e.g., 
Goffaux and Rossion, 2006) in conjunction with commonly used 
measures of holistic processing described below. Such techniques 
lead to feature details and boundaries becoming degraded, result- 
ing in a greater reliance on the use of holistic information in 
discriminating between faces (Sergent, 1985). Interestingly, larger 
effects are found on measures of holistic processing when these 
degraded stimuli are used, lending support to the idea that holis- 
tic processing does not necessarily require detailed information 
about the features of a face (McKone, 2004; Goffaux and Rossion, 

2006) . 

In the holistic/part-based model, holistic and configural pro- 
cessing are sometimes equated to mean the same thing (e.g., 
McKone and Yovel, 2009), implying that all of the configural 
information in a face must be processed/integrated at the same 
time. We favor a holistic/part-based model as it emphasizes that 
holistic and part-based processing are both separable and par- 
allel processes that are equally important to face perception. 
In line with other advocates of this model the terms config- 
ural and holistic will be taken to mean the same thing here but 
only when configural processing involves the integration of all 
(as opposed to just some) of the spacing information between 
features. 

However even within the holistic/part-based model there is 
a lack of consensus as to what constitutes the holistic repre- 
sentation of a face. Some versions view holistic processing as 
only including spacing differences between the features them- 
selves, without including information about feature shape (which 
is instead processed in a part-based manner; Rossion, 2008). 
However recent reviews argue that there is a large body of evi- 
dence to suggest that feature shape information, instead of being 
coded separately, is also included in this representation (McK- 
one and Yovel, 2009; Riesenhuber and Wolff, 2009; Yovel, 2009). 
Of course this has implications for the way in which the holis- 
tic representation of a face should be conceptualized in the face 
perception literature. In a conceptual model that is not inclusive 
of feature shape, feature center-points would be the only reli- 
able markers for calculating configural information (McKone and 
Yovel, 2009). However these findings suggest that, feature spacing 
differences are calculated from key points surrounding the fea- 
tures themselves. These points are referred to as landmark (e.g., 
Rajapakse and Guo, 2001) or fiducial points (e.g., Tong et al, 

2007) . Alternatively it may be that the metric components of 
the holistic representation of a face are encoded more implic- 
itly and do not rely on featural boundaries (McKone and Yovel, 
2009). 

Familiar (e.g., Young et al, 1987) and unfamiliar (e.g., Tanaka 
and Farah, 1993) faces have been used extensively with different 
measures of holistic processing. The next section contains a review 
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FIGURE 2 | Two different models of holistic processing. (A) A face 
perception model adapted from Farah (1996). Object and face perception 
are independent of one another. Faces are perceived as undifferentiated 
wholes while objects are processed on the basis of their individual parts. 
(B) A holistic/part-based model of face perception. Holistic/configural and 
part-based processing work in parallel, making separate contributions to 
face perception that can be assessed independently of one another. 
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of these different measures and how they have been used to explore 
holistic processing for faces. 

AN OVERVIEW OF INDIRECT AND DIRECT MEASURES OF 
HOLISTIC PROCESSING 

Although the exact nature of holistic processing is still under 
debate, it is generally agreed that there exist a variety of para- 
digms that can be used to both measure holistic processing and 
manipulate the extent to which it is used (see McKone, 2010 or 
Tanaka and Gordon, 2011 for review). These measures fall into 
three main categories: indirect measures such as the dispropor- 
tionate inversion effect; commonly used direct measures such as 
the composite task and part-whole task and; alternate measures of 
holistic processing that are not commonly used. These measures 
have been primarily used in conjunction with tasks that involve 
participants making identity related judgments about faces but 
there is also evidence to suggest holistic processing may be nec- 
essary for making other important face related judgments (e.g., 
expression and attractiveness) as well as identity judgments about 
bodies. 

THE DISPROPORTIONATE INVERSION EFFECT: AN INDIRECT MEASURE 
OF HOLISTIC PROCESSING 

When faces and objects are turned upside down they become 
harder to recognize. This inversion effect is significantly larger 
for faces than it is for most other objects (Yin, 1969). However 
because this effect is disproportionate (i.e., larger for faces, but 
still occurs for objects), it cannot tell us whether there are qual- 
itative differences between face and object processing (Valentine, 
1988, 1991; McKone, 2010). For this reason the disproportionate 
inversion effect cannot be considered to directly measure holistic 
processing. One study has also shown indirect evidence of holis- 
tic processing in making attractiveness judgments, by showing the 
negative effect that inverting a face has on making reliable judg- 
ments about this type of attribution (Santos and Young, 2008). 
Face-sized inversion effects have also been shown for human bod- 
ies, at least with heads (Reed et al, 2003; Minnebusch et al., 2009; 
Yovel et al, 2010; Robbins and Coltheart, 2012a). 

THE COMPOSITE TASK AND PART-WHOLE EFFECT: COMMON DIRECT 
MEASURES OF HOLISTIC PROCESSING 

The following two paradigms assess and manipulate holistic pro- 
cessing directly and can be considered central to the face per- 
ception literature as they have also been used to show qualitative 
differences between upright and inverted faces and between faces 
and other objects (McKone, 2010). 

The composite task (Young et al., 1987; Hole, 1994; Le Grand 
et al, 2004), involves the use of stimuli created by joining together 
complementary halves of two different faces (usually the top and 
bottom halves). Two variants of the composite task are typically 
used depending on whether the faces are familiar or unfamiliar 
to the participant. In the familiar face version of the task, partici- 
pants are asked to identify one half of a single composite face while 
ignoring the other (Young et al., 1987; see Figure 3A). Aligned face 
halves create the illusion of a new identity, making it harder to 
attend to one half of the face while ignoring the other. However 
this effect disappears when face halves are misaligned (vertically 



off-set). The unfamiliar face version of the composite task usu- 
ally involves participants identifying whether the matching half of 
two composite faces (e.g., two top halves) belong to the same or 
two different individuals (e.g., Hole, 1994; Le Grand et al, 2004; 
see Figure 3B). There is some dispute over how holistic process- 
ing in this same/different version of the composite task should 
be measured. The original version uses differences in accuracy 
and/or reaction time between aligned and misaligned "same" trials 
to measure a composite effect (e.g., Le Grand et al., 2004; Robbins 
and McKone, 2007; Rossion and Boremanse, 2008). In this version 
the half-to-ignore is always from different people, making the pre- 
dictions for "same" trials clear but the predictions for "different" 
trials unclear, which is why they are excluded. Specifically if two 
same facial halves are aligned with facial halves of two different 
people they will be perceived as looking less similar compared to 
when these composites are misaligned and integration does not 
occur. The extent to which the halves-to-match on different tri- 
als appear to be more or less similar is dependent on how alike 
the halves-to-be-ignored are. If different halves-to-ignore are very 
dissimilar, then when aligned with different halves-to-match they 
may actually look even more different than when misaligned. How- 
ever if halves-to-ignore are very similar, then when aligned with 
different halves-to-match they may actually look more similar than 
when misaligned. As this likeness is almost impossible to control 
for, the data obtained from different trials is not used to measure 
for a composite effect when this approach is taken (McKone and 
Robbins, 2007). 

An alternate version of the composite task uses both same and 
different trials to measure a composite effect (e.g., Gauthier et al, 
1998). According to this method the extent to which an observer 
perceives two facial halves to be the same or different depends on 
whether the half-to-be-ignored requires the same decision (con- 
gruent) or a different decision (incongruent), resulting in four 
different ways in which composite stimuli can be paired when 
using this paradigm (half-to-match and half-to-ignore both same 
or both different, half-to-match same but half-to-ignore different, 
or half-to-match different but half-to-ignore same; Gauthier and 
Bukach, 2007). The measure of holistic processing is then usu- 
ally taken not as the difference between aligned and misaligned 
trials, but as the difference between congruent and incongruent 
trials. The advocates of this version have argued that, for unfa- 
miliar faces, the original composite effect is mainly decisional not 
perceptual (Richler et al., 2008). However, this alternate measure 
leads to bizarre results in which misaligned novel objects are appar- 
ently "holistically" processed (e.g., Wong et al., 2009). It also does 
not address the problem of predictions for different trials, mean- 
ing that effects may be diluted (or strengthened) in unpredictable 
ways. 

On the standard measure of the composite task it has been 
shown that children as young as 3 months holistically process faces 
(Turati et al, 2010). In adults there is also a composite effect for 
human bodies (Robbins and Coltheart, 2012b) but there is no 
composite effect for car-fronts, novel objects, or dogs (the latter 
two even in experts; Cassia et al., 2009; Gauthier et al., 1998; Gau- 
thier and Tarr, 2002; Robbins and McKone, 2007). Interestingly, 
for bodies the composite effect is larger for left-right integra- 
tion, a form that has generally not been tested, compared to 
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FIGURE 3 | The original versions of the composite task: (A) the 
naming version of the task used for familiar faces. The top (or 

bottom) half of the composite is harder to identify when the halves are 
aligned (pictured left) as opposed to misaligned (pictured right). In this 



the more usual top-bottom integration (Robbins and Coltheart, 
2012b). Robbins and Coltheart argue that this may indicate an 
important role for holistic processing in non-identity judgments 
such as attractiveness and communication, because integration 
for these judgments may be more important between left and 
right halves. For attractiveness (Abbas and Duchaine, 2008) and 
emotional expression (arguably a form of communication; Calder 
et al, 2000; White, 2000; Tanaka et al., 2012) only top-bottom 
integration has been tested but does suggest holistic process- 
ing. There is also evidence to suggest that holistic processing is 
used when making judgments about the gender (Baudouin and 
Humphreys, 2006), age (Hole and George, 2011), race (Michel 



example the top half belongs to Kevin Rudd and the bottom half 
belongs to John Howard (both former Prime Ministers of Australia). (B) 
The same/different version using unfamiliar faces (where the top half is 
the half-to-match). 



et al., 2006), and perceived trustworthiness (Todorov et al., 2010) 
of a face. 

Another commonly used measure of holistic processing is the 
part-whole task (Tanaka and Farah, 1993). In this paradigm par- 
ticipants are required to become familiar with a face (e.g., Bill) 
and then asked to either identify which of two faces, that differ 
only by one feature, shows the learned person (e.g., Bill versus 
Bill with Jim's lips) or which of two facial features belongs to that 
particular person (e.g., Bill's mouth versus Jim's mouth). Partic- 
ipants are better at identifying features in the context of a whole 
face than in isolation, but not when faces are inverted or scram- 
bled (Tanaka and Farah, 1993). This is because holistic integration 
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creates the illusion that the new feature within the old face is a new 
face, making it easier to tell the two faces apart. However when the 
features are presented in isolation, or within an inverted or scram- 
bled face, they must be discriminated in a part-based fashion, 
which is harder. A matching version of the task produces simi- 
lar results (Davidoff and Donnelly, 1990; Donnelly and Davidoff, 
1999). 

A variation of the part- whole task involves also altering the spa- 
tial configuration between features (Tanaka and Sengco, 1997). By 
altering the spacing between the eyes, one's ability to accurately 
recognize changes in the other features of the face diminishes. 
This is thought to occur because altering one source of informa- 
tion within the holistic representation of a face (i.e., the spacing 
between features) detrimentally affects the perception of other 
parts (including the individual features; Tanaka and Sengco, 1997; 
Tanaka and Gordon, 2011). Thus this task provides evidence 
of the importance of configural information within the holistic 
representation of a face. 

Children show evidence of holistic processing on the spacing 
version of the part-whole task at age 4 (Pellicano et al., 2006). 
Both children aged 8-10 and adults also show similar sized part- 
whole effects for bodies as for faces (Seitz, 2002). Smaller or no 
part-whole effects are found for houses, chairs, novel objects, 
biological cells, and dog-faces (Davidoff and Donnelly, 1990; 
Tanaka and Farah, 1993; Tanaka et al, 1996, cited in Tanaka 
and Gauthier, 1997; Gauthier et al., 1998; Donnelly and Davidoff, 
1999). 

Overall, standard measures of holistic processing show effects 
for faces and perhaps human bodies but much smaller or no effect 
for other objects. They also show that holistic processing may 
develop reasonably early. 

LESS COMMONLY USED MEASURES OF HOLISTIC PROCESSING 

Other measures of holistic processing have either targeted the 
effects of masking or isolating specific regions of the face while 
making different types of discriminative judgments, or have 
explored the effect of inversion on discriminating between or 
detecting faces that have undergone different manipulations. 

Tasks that involve masking, isolating, or drawing attention 
toward specific local regions of the face (e.g., nose region, eye- 
brow region) can provide a useful indication as to how much 
participants rely on the feature(s) in these regions to make dif- 
ferent types of face-specific judgments (e.g., Gosselin and Schyns, 
2001; Robbins and McKone, 2003; Sekuler et al, 2004; Santos and 
Young, 2011). The same process can also be applied to broader 
facial regions that contain more configural information such as 
the internal (center area of the face where the most salient fea- 
tures are located) and external (outer area of the face including 
the chin, forehead, hairline, and ears) regions of the face (Santos 
and Young, 2011). If accuracy is higher for whole face trials than 
isolated region trials then this is suggestive of holistic process- 
ing (Santos and Young, 2011). As noted earlier it is also possible 
to directly manipulate holistic versus feature processing by tech- 
niques such as blurring or spatial filtering (to remove feature 
information; McKone, 2004; Goffaux and Rossion, 2006) or scram- 
bling of features (to remove holistic configuration; Tanaka and 
Farah, 1993). 



Common measures of holistic processing produce effects for 
upright faces (e.g., Young et al., 1987; Tanaka and Farah, 1993), 
but effects are greatly reduced or absent for inverted faces. Because 
of this, information that becomes harder to detect when a face is 
displayed upside down (whether it is a single feature or more rela- 
tional/configural in nature) has been argued to form part of the 
holistic representation of that face (e.g., McKone and Yovel, 2009). 
Detecting changes to either the spacing (e.g., Goffaux and Rossion, 
2006) or feature shape information (see McKone and Yovel, 2009, 
for review), within a face is significantly harder to do when faces 
are inverted. The difference between upright and inverted faces 
with manipulations to spacing and/or features has then also been 
used to measure holistic processing in other-race faces, for exam- 
ple. Children show the ability to detect spacing changes to faces 
from 5 months of age (e.g., Bhatt et al., 2005), but larger changes 
are usually needed than for adults (see review in Mondloch and 
Thomson, 2008). 

Other paradigms such as the superimposed faces task (Mar- 
tini et al, 2006) and the Mooney face task (McKone, 2004) have 
also been used as measures of holistic processing, but because the 
nature of these tasks involve detecting faces (or salience differences 
between faces) instead of making different types of discrimi- 
nations between these faces, they could be argued to be more 
representative of a face detection stage, rather than a face discrim- 
ination stage of holistic processing (see Tsao and Livingstone, 2008 
for review). 

Overall, measures of holistic processing suggest faces and bod- 
ies are processed differently to other objects and there are a variety 
of ways of tapping into this difference. Further, holistic processing 
may be used for not only identity but also other judgments such as 
attractiveness, expression, age, gender, and social attributions such 
as trustworthiness. Robbins and Coltheart (2012b) particularly 
suggest that examining left-right integration as well as top-bottom 
integration could lead to important increases in understanding of 
the nature of holistic processing. They predict, for example that 
for attractiveness holistic processing may be stronger for left-right 
integration because of the importance of symmetry to judgments 
of attractiveness. Next we consider another aspect of faces that 
may greatly increase our understanding of holistic processing. 

WHAT CAN MOVING FACES TELL US ABOUT HOLISTIC 
PROCESSING? 

When we see faces and bodies in real life, they are usually engaged 
in some type of motion. However the vast majority of stud- 
ies using the above measures of holistic processing have only 
used static faces as stimuli. Although the role and effectiveness 
of both structural and dynamic information in recognizing faces 
has been explored extensively in the literature, very few studies 
have applied measures of configural and holistic processing to 
moving facial stimuli. In relation to making identity judgments 
from moving faces, there are only two inversion effect stud- 
ies focusing on famous faces engaging in conversation (Knight 
and Johnston, 1997; Lander et al., 1999) and only one study 
that has applied the composite task to unfamiliar faces engag- 
ing in rigid motion (Xiao et al., 2012). In addition to this 
there has been one study that has explored inversion effects for 
subtle facial expression processing for static and moving faces 
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(Ambadar et al, 2005). Using moving, as opposed to static, 
facial stimuli on measures of holistic processing may help to 
clarify conceptual issues regarding what is included in the holis- 
tic representation of a face; improve the ease with which the 
components of a face are processed as a whole and; enhance 
face perception by providing additional information about a 
face. 

Facial motion can be rigid or non-rigid and in most cases 
human interactions involve a combination of both types of move- 
ment (OToole et al, 2002; Roark et al., 2003). Rigid motion 
includes turning, nodding, and shaking of the head (Roark et al, 
2003). Rigid motion is also apparent when moving past a still 
person/object. Non-rigid motion refers to the elastic change that 
can occur within the features of a face (e.g., moving the lips in 
conversation; O'Toole et al, 2002; Roark et al., 2003). 

Within the unfamiliar facial recognition literature there are 
conflicting results concerning how much additional information 
movement provides for face recognition, over the use of still 
images. Some studies have found no such motion advantage for 
any type of facial movement (e.g., Christie and Bruce, 1998), while 
others have found a beneficial role for rigid motion (e.g., Pike et al., 
1997), non-rigid motion (e.g., Thornton and Kourtzi, 2002), or 
both types of motion (e.g., Lander and Bruce, 2003). Differences 
in these results may, in part, be explained by the different types 
of tasks used. These included old/new recognition memory tasks 
(e.g., Lander and Bruce, 2003), sequential matching tasks (Thorn- 
ton and Kourtzi, 2002), and a "delayed visual search paradigm" 
(Pilzetal, 2006, 2009). 

Only one study has applied a direct measure of holistic pro- 
cessing to moving faces, and focused only on rigidly moving heads 
during the learning phase of the experiment (i.e., the test phase 
included only static faces; Xiao et al., 2012). This study found 
no composite effect for temporally coherent rigid head move- 
ment (heads rotating in view) but found composite effects for 
both temporally incoherent (random frames of different views) 
and temporally separate (visual noise separating frames) photo 
sequences. From these findings Xiao and colleagues concluded 
that people were more likely to use part-based than holistic 
processing in recognition tasks involving rigid head movement. 
A different study focusing on holistic processing and changes 
in facial viewpoint in static faces arrived at a similar conclu- 
sion, finding an increased reliance on part-based processing for 
some facial viewpoints but not others using inversion and fea- 
ture scrambling techniques and displaying these in the periphery 
(McKone, 2008). However, McKone (2008) also found no differ- 
ence in holistic processing at three different viewpoints [frontal 
(0°), three-quarter (45°), or profile (90°)] using the composite 
task. Together these results, although not conclusive, suggest that 
rigidly moving faces may produce weaker effects on measures 
of holistic processing than static faces. This may result from a 
greater dependence on featural (part-based) information in iden- 
tifying faces due to apparent changes within its second-order 
configuration. 

Indirect evidence of holistic processing for moving face stim- 
uli has also been found using the inversion task (Knight and 
Johnston, 1997; Lander et al, 1999). These studies found similar 



sized inversion effects when participants either identified celebri- 
ties from still frames or from dynamic video sequences, suggesting 
that holistic processing may operate in similar ways for static and 
dynamic faces. Inversion effects have also been found for moving 
faces in judging subtle facial expression (Ambadar et al, 2005). 
However the size of the inversion effect was no different to that 
found when the same expression judgments were made from sta- 
tic faces. Overall these studies do not suggest increased holistic 
processing for moving faces but more research is needed in the 
area. 

Using rigid and non-rigid moving faces in conjunction with 
common measures of holistic processing may help to clarify con- 
ceptual issues regarding what is included in the holistic repre- 
sentation of a face (c.f. McKone and Yovel, 2009). There would 
be challenges in using standard measures with moving faces 
(e.g., getting facial halves from two different people to remain 
aligned as well as move in unison with one another). How- 
ever existing paradigms would serve as a useful starting point 
in the exploration of holistic processing for moving faces and 
the challenges that arise from this task may give way to alternate 
measures. 

Regardless of whether the holistic representation of a face is 
thought to be inclusive of feature shape (McKone and Yovel, 2009; 
Riesenhuber and Wolff, 2009; Yovel, 2009) or not (Rossion, 2008), 
any motion that includes viewpoint change (e.g., turning the head) 
should lead to apparent changes in the configuration of a face 
(Compare Figures 4A,B). Based on this one might predict that 
a turning or nodding face, as opposed to a static one, may lead 
to some sort of quantitative change on measures of holistic and 
configural processing. However predictions about the effect of 
feature (as opposed to whole head) movement on holistic process- 
ing would depend on whether feature shape was included in the 
holistic representation of a face (c.f. McKone and Yovel, 2009). To 
provide an example of this, imagine a situation in which an unfa- 
miliar person yawns while you are processing their face (Compare 
Figures 4A,C). You might expect to see their mouth widening 
and their eyes narrowing (among other changes) as they begin to 
yawn. If the holistic representation of a face was inclusive of fea- 
ture shape and was measured from feature boundaries, you would 
expect several changes to occur in the second-order configuration 
of that face. For instance the lower mouth boundary (the bottom 
lip) would move further away from the nose and eye boundaries. 
However if feature shape was not included in the holistic repre- 
sentation of a face (c.f. Rossion, 2008), and relational information 
was instead measured from the center-points of shapeless features, 
then little change would occur in the second-order configuration 
of that face. The center-point of the mouth would change little 
in relation to other feature center-points regardless of how far 
the mouth was opened. If the holistic representation of a face 
is inclusive of changes in feature shape then one might expect 
to see some sort of quantitative difference between faces with 
moving features and still faces on measures of holistic and con- 
figural processing. If the holistic representation of a face is not 
inclusive of changes in feature shape then one might expect to 
see little or no quantitative difference between these two types of 
stimuli. 
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FIGURE 4 | Examples of unaltered, feature shape inclusive and feature 
center point only conceptual models [based on models proposed by 
McKone and Yovel (2009)] for (A) a neutral face, (B) a turning face, and 
(C) a yawning face. Black squares represent key points from which configura 



information is calculated, black lines represent configural information, white 
circles represent other contour points, and white lines follow the shapes of 
the features. Center-point only images have been blurred to emphasize that 
feature shape does not factor into the holistic representation in this model. 



Using moving as opposed to static faces on these measures 
may also allow the face to be perceived more easily and effectively 
as a whole due to the applicability of additional Gestalt grouping 
principals specific to moving stimuli. When faces are still, their fea- 
tures possess a range of properties that allow them to be grouped 
together effectively as a whole. These include the "proximity" of the 
features within the face, the "common region" they share (i.e., the 
face itself) and the approximate bilateral "symmetry" generated 



by the shape, positioning, and spacing of facial features when they 
are seen from a frontal viewpoint. When faces engage in motion 
two additional Gestalt grouping principles can be applied to their 
features. The first is the "common fate" shared by the facial fea- 
tures when they move in the same direction and at the same pace 
during rigid motion and the second is the "synchrony" gener- 
ated by the movement of different facial features at the same time 
(but in different ways) during non-rigid motion (see Wagemans 
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et al., 2012a, for a comprehensive overview of Gestalt grouping 
principles). 

In line with the "representation enhancement hypothesis" 
(O'Toole et al, 2002; Roark et al, 2003), it is also possible that the 
use of moving faces on these measures may enhance facial percep- 
tion and recognition in a variety of other ways. Rigid motion may 
assist in holistic processing because it not only provides additional 
exemplars of that face through viewpoint change but may also 
allow for a 3D representation of that face to be developed, allow- 
ing it to be identified from previously unseen viewpoints (Christie 
and Bruce, 1998). Rigid and non-rigid motion may also assist in 
facial processing by revealing information about how much the 
internal features of a face and their spatial configuration can vary 
during different types of movement (Christie and Bruce, 1998). 
For example when a face is engaged in conversation (with minimal 
to no head movement) the distance between the upper and lower 
boundaries of the mouth would vary considerably whereas the 
distance between inner corners of the eyes would remain invari- 
ant. However during head rotation along the yaw axis the distance 
between the upper and lower mouth boundaries would remain 
relatively invariant while the apparent distance between the eyes 
would change. 

Studying faces in motion, therefore will allow us to determine 
the information most relevant to the perception of a face as a 
whole. Exploring the joint contribution of facial structure and 
motion to holistic processing will also allow facial processing to be 
explored in a more naturalistic context. This will potentially lead 
to a clearer understanding about how holistic processing operates 
in the real world. If holistic processing was found to operate in a 
similar way for both still and moving faces then such experiments 



would serve to strengthen the validity of existing research in the 
field of facial processing. However if holistic processing for mov- 
ing faces was somehow different from what has previously been 
found for static faces, then the ecological validity of existing find- 
ings with static faces would be put into question and theories of 
face recognition might need to be rethought. 

CONCLUSION 

This review article has traced the routes of holistic processing 
from Gestalt theory, distinguishing between commonly referenced 
models of face perception and how they differ from a purely part- 
based model of perception. Despite the wide array of research in 
the face perception literature, further clarification is needed in the 
field. We have reviewed both common and lesser known measures 
of holistic processing. These measures support a holistic/part- 
based model of face perception whereby holistic and part-based 
processing make parallel and separable contributions to face per- 
ception. Both the individual features and the holistic represen- 
tation of a face appear relevant to face perception, however it 
is still uncertain what is included in the holistic representation. 
Establishing whether feature shape plays a role in the holistic 
representation of a face will help determine whether configural 
information is measured from the center-points of shapeless blobs 
or from key points surrounding the featural boundaries. Using 
moving facial stimuli with existing measures of holistic processing 
should provide a useful means of answering this question. The 
use of moving faces in this field should also allow facial process- 
ing to be explored in a way more similar to how it occurs in the 
real world where faces relay socially relevant information through 
movement. 
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